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We consider the problem of carrier frequency offset (CFO) estimation for a two-way relaying system based 
on the amplify-and-forward (AF) protocol. Our contributions are in designing an optimal preamble, and the 
corresponding estimator, to closely achieve the minimum Cramer-Rao bound (CRB) for the CFO. This optimality 
is asserted with respect to the novel class of preambles, referred to as the block-rotated preambles (BRPs). This 
class includes the periodic preamble that is used widely in practice, yet it provides an additional degree of design 
freedom via a block rotation angle. We first identify the catastrophic scenario of an arbitrarily large CRB when 
a conventional periodic preamble is used. We next resolve this problem by using a BRP with a non-zero block 
. rotation angle. This angle creates, in effect, an artificial frequency offset that separates the desired relayed signal 

m . 

. from the self-interference that is introduced in the AF protocol. With appropriate optimization, the CRB incurs only 

o 

£Nl ■ marginal loss from one-way relaying under practical channel conditions. To facilitate implementation, a specific 



low-complexity class of estimators is examined, and conditions for the estimators to achieve the optimized CRB 
is established. Numerical results are given which corroborate with theoretical findings. 
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I. Introduction 

Two-way relaying is a spectrally efficient communication technique for two sources to exchange 
independent data Q, 0. We consider two-way relaying based on analogue network coding, also known 
as the amplify-and-forward (AF) scheme with two transmission phases. In the first phase, both sources 
concurrently send their data to the relay, while in the second phase, the relay sends a scaled version 
of the received signal to both sources. Due to the sharing of spectral resources, the signal sent by one 
source is delivered not only to the other source, but also back to itself as self-interference through the 
relay. Assuming each source has knowledge of the two-way relay channel via channel estimation Q, each 
source can subtract the self-interference and thus can recover the desired signal sent by the other source. 

A potential application of two-way relaying for high data-rate transmissions over wireless channels is 
in multi-carrier systems, such as orthogonal frequency division multiplexing (OFDM) systems. It is well 
known that the presence of a carrier frequency offset (CFO) between the source and the destination can 
severely impair system performance. Hence, it is critical to consider the problem of CFO estimation in two- 
way relaying 01, (5]|, so that the detrimental effect can be mitigated at the receiver. Typically, a preamble 
is used to facilitate the estimation of the CFO. In practical implementations, the CFO is estimated without 
any prior knowledge of the channel as it constitutes the first step in most communication systems, such 
as in the IEEE 802.1 In Standard (6||, before any channel estimation is performed. 

Given knowledge of the preambles used, CFO estimation in two-way relaying is fundamentally different 
from the classical problem of CFO estimation in point-to-point channels, or even in one-way-relaying. This 
is because in practice the channel is not known exactly and hence the removal of the self-interference is not 
straightforward. Such self-interference corrupts the desired relayed signal and causes the CFO estimator 
to perform poorly. Despite some recent progress in this direction [H, (5]|, the fundamental reason for 
the loss in performance, if any, is not clear. In this regard, the Cramer-Rao bounds (CRB) serves as an 
important metric, since it gives the lowest possible variance for any unbiased estimator Q. 

To understand the potential problem caused by self-interference, it is insightful to consider the following 
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toy problem. Suppose we wish to estimate the frequencies of two tones received with unknown amplitudes 
and phases in the presence of additive white Gaussian noise (AWGN). Here the unknown amplitudes and 
phases represent the channel distortions. The CRBs for the estimation of both frequencies turn out to be 
arbitrarily large as the frequencies approach each other (8]|. Compared to the two-way relaying case where 
one frequency (corresponding to the CFO due to the desired relayed signal) is unknown while the other 
frequency (corresponding to the self-interference) is zero, this toy problem is a harder problem because 
both frequencies are unknown and to be estimated. However, it captures the essence of the CFO estimation 
problem in two-way relaying, in that two different signals carried by unknown channels are present. In 
fact, we shall see that both problems share the same fundamental limitation, namely that the CRB goes 
to infinity as the difference in the carrier frequencies approaches zero. This motivates a re-design of the 
preamble used for CFO estimation, so as to remove this fundamental limitation. 

Although the problem of preamble design in point-to-point channels for CFO estimation has been 
considered in the literature, e.g. JH, surprisingly there appears to have no such work in two-way relay 
systems. In this paper, we will introduce a novel preamble design that in effect introduces an artificial 
frequency offset to remove the fundamental limitation that we have identified. 

Our specific contributions are as follows. We consider the problem of preamble design and CFO 
estimation in a two-way relaying system, assuming a time-invariant multipath wireless channel. 

• We establish that reusing conventional periodic preambles at both sources, such as that used in the 
IEEE 802.11 standards [6]|, can result in an unbounded CRB for the CFO estimator. 

• To overcome the above problem, we propose the novel class of block-rotated preambles (BRPs) for 
CFO estimation in two-way relaying. The BRP includes the periodic preamble as a special case, yet 
provides an additional degree of design freedom via a block rotation angle. Intuitively, the block 
rotation angle introduces an artificial block-level frequency offset that enables the preambles from 
the two sources to be well separated in the frequency domain. 

• We obtain the CRB based on the class of BRP for the cases where the channel is either known a 
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priori or not. To obtain a closed-form expression, we use an approximation of the CRB to optimize 
the BRP. Under this approximation, we show that the CRB for two-way relaying can approach the 
CRB for one-way relaying, i.e., the CRB where no self-interference is present. 

• To facilitate implementation, a specific class of estimators, based on linear filtering followed by 
conventional CFO estimation used in point-to-point transmission, is proposed. The necessary and 
sufficient condition for this class to achieve the one-way-relay CRB is derived. 

• Numerical results are obtained which corroborate with our analysis, and which illustrate the tightness 
of the approximations made for the BRP design. 

This paper is organized as follows. The system model for the two-way relay is developed in Section [Ell 
The BRP is proposed in Section |inj The corresponding CRB is obtained in Section [TV] assuming some 
knowledge of the channel is available or none; an approximation of the CRB is also provided. Next, the 
BRP is optimized in Section |VJ Section [VT] proposes a low-complexity linear filter that does not suffer 
any loss in the CRB. Simulation results are given in Section IVII1 and finally conclusions are made in 
Section ETUI 

Notations: Boldfaces in capital and small letters are reserved for denoting matrices and vectors re- 
spectively. All indices in matrices and vectors start from zero. The symbols <g>, E 5 (-), I n and mxn 
represent convolution, Kronecker product operation, expectation function over the variables in the set S, 
n x n identity matrix, and m x n null matrix, respectively. In general, we collect the set of N signals 
{r n , n — 1, • • • , N} as a column vector r = [r l5 • • • , r N ] T . 

II. System Model 

We consider the two-way relay system consisting of one relay and two sources Si and 5*2, referred to by 
the subscripts 0, 1 and 2, respectively. The relay and sources S\ and S 2 transmit at carrier frequencies f , fx 
and fa, respectively. The carrier frequencies are set close to a pre-assigned value, but typically deviate 
slightly from one another due to hardware imperfections. For the link from node i to node j, we assume 
a L-tap frequency selective channel modeled by the finite-impulse response (FIR) hij tn ,n = 0, • • ■ , L— 1. 
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Fig. 1. Two-way relay system model from the perspective of source Si. 



The actual maximum number of the links can be less or equal to L without loss of generality. For notational 



convenience, we collect the L channel taps for the ijth link as = [h 



ij,0i 



h 



ij,L-l\ 



We employ the amplify-and-forward protocol consisting of two phases. Figure Q] illustrates the subse- 
quent processing performed from the perspective of Si. In the first phase, both Si and S2 concurrently 
transmit their packets x ln e j27r ^ in and x 2n e : ' 27r ^ 2n to the relayj. Unless otherwise specified, n denotes the 
discrete time index that runs from to N — 1. Since we focus only on the CFO estimation problem, 
{x in } is taken to be the preamble of source Si,i = 1,2. The relay down-converts the received signal to 
baseband by taking reference from its carrier frequency f . Hence the relay receives the baseband signal 



as 



r 0n = e~ j27rfon { {x ln e^ hn ) ® h 10 , n + {x 2n e j2 ^ n ) ® h 20 , n } + v 0ni n = 0, • • • , N - 1, 



(1) 



where vo n is the zero-mean AWGN with variance a 2 . In the second phase, the relay scales r 0n such that 
I J2n=i r on\ 2 /N equals the expected transmit power. We denote the scaling as a > 0. Then the relay 
up-converts the baseband signal to its carrier frequency / , and broadcasts it to both sources. 

'if Si and S2 do not transmit concurrently but with a small time difference, this delay is easily accommodated by introducing zeros in 
the first few samples of the channel FIR. 
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Let us focus on the subsequent processing only at source Si. The results at S 2 can be obtained similarly. 
After down-converting the received signal from its carrier frequency / l9 source Si receives in baseband 

Tm = e~^ hn { (ar 0n e^° n ) ® h m , n } + v ln (2) 

where v ln is zero-mean AWGN with variance o\. 

We have assumed that the relay performs a digital amplify-and-forward scheme, where the signals is 
scaled or amplified in the baseband. The above system model also holds if the relay performs an analogue 
amplify-and-forward scheme, where all processing is performed instead in the radio-frequency domain. 

After some algebraic manipulations, we express © as the following (more insightful) signal model: 

Tm = rn,„ + e^ h ~ h >r 21 . n + u ln (3) 

In ©, rii >n = hn t n®xi n and r 2lyTl — h2i,n®x 2n denote the equivalent received signals with their equivalent 
channels given by hn >n = ae~ j27Tfin (h i jn ®h 10 , ri ) and h 2 i, n — ae~ j27T ^ n (h i in ^)h 20jn ), respectively, while 

um = e j2 < f °-^ n { (ah 0hn e~^ fon ) ® v , n } + v ln . (4) 

Clearly, the equivalent noise Ui n depends only on the relay-to-self channel tap hoi >n . 

III. The Class of Block-Rotated Preambles (BRPs) 

Our objective in this paper is to design the preambles {xi n } and {x 2n }, so that the CFO (f 2 — fi) is 
estimated as accurately as possible. We restrict our study to the class of the block rotated preambles (BRPs) 
preambles, which is proposed in this section. As we shall see, the BRP overcomes a significant problem 
when periodic preambles are used, and when optimized, the BRP approaches the ideal performance where 
no self-interference is present. 

A. Definition 

A BRP x = [x , • • ■ , X(m + i)l_i] t of length (M + 1)L is uniquely defined by a basis block b = 
[bo, b 2 , ■ ■ ■ , bL~i} T 6 C Lxl and a block rotation angle 9 G [0, 2tt), according to 

x=[b T ,pb T ,p 2 b T ,...p M b T f (5) 
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where p = exp(j9). If 9 = and so p = 1, the BRP becomes a periodic preamble, which is used widely 
in conventional point-to-point communication systems, e.g. in H. Hence, we may treat the phase rotation 
9 as being applied to the periodic preamble on a block level, i.e., the phase remains constant for every 
block of L samples then increments by 9 for the next block. 

B, Equivalent System Model 

Henceforth, we assume source Si uses the BRP Xj with basis block bj and block rotation angle 9i, 
i = 1,2. In practice, a time synchronization algorithm is used to estimate the arrival of the preambles 
at the receiver. Any timing error does not destroy the block rotation property of the BRP as given in 
©. Hence, without loss of generality we assume perfect knowledge on the arrival of the preambles. To 
remove any possible inter-symbol interference from other transmissions, we follow the common practice 
of discarding the first L samples that constitute the first block of preambles. Thus, we have N = ML 
samples of received signal left. For notational convenience, we reset the time index n in © and © to 
start from the second basis block. From ©, after some algebraic manipulations we obtain the received 
signal at Si as 
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P 2 21 1 L 


r 2 i + 




n M l 




n M l 


G21 




G11 



where p 21 = e^ 21 , <j> 21 = 2ir(f 2 - f x )L + 2 , p u = e^ 11 , n = 9 X . We recall that N = ML and the vector 
and matrix dimensions are r^ii! e C Nxl , Gn, G 2 i € C NxL , while r n ,r 2 i G C ixl . 

We interpret the terms in the system model © as follows. The first term G 2 ir 2i is the signal vector that 
contains useful information of the CFO (/ 2 — fi) via G 21 (which depends on 21 ), while r 21 is a channel- 
related nuisance parameter that depends on hoi, h 20 . The second term Guru is the self-interference vector 
at direct current that can potentially interfere with the CFO estimation, where rn is another channel-related 



nuisance parameter that depends on h 01 , h 10 . Subsequent results will support and further clarify the above 
intuitive view. Finally, the last term corresponds to the additive coloured Gaussian noise in ©, which can 
be expressed as^ 

ui = K(h i)v + vi (7) 

where K(h i) e £Nxn+l-i has the ith rQW as [o lx(i _ 1); h i, Oi x(JV -i)], while v = [v ,-l+i, , vo,n] T 
and vi = [vi,i, Ui,2, ■ • • > v i,n] T are AWGN vectors. In ©, without loss of generality we have discarded the 
phase rotation e' 2n v°~f 1 ' n in ©, because all random distributions are assumed to be circularly symmetric. 
Given hoi, the vector ui is Gaussian distributed with mean zero and covariance matrix 

E[ Ul uf ] 4 R(h 01 ) = K(h 01 )K H (h i) + I (8) 

where without loss of generality, we assume noise variances to be one, i.e., a\ — of = 1. We note that if 

h i is random, then u x is no longer Gaussian distributed in general. 

Remark 1: If source Si does not transmit in the first phase, i.e., X\, n = for all n, then we obtain a 
one-way relaying system. The system model for one-way relaying is thus given by © with rn = Olxi- 

For subsequent derivations, it is convenient to re-write the complex-valued system model in © as the 

real-valued system model: 

r = Ar + n (9) 

T rp Re([Gn,G 2 i]) -Im([Gn,G 2 i]) 
where r = [Re(rf ), Im(rf )] , n = [Re(uf ), Im(uf )] , A = 

Im([Gn,G 21 ]) Re([G n ,G 2 i]) 

and?4[Re([rf 1 ,4 1 ]),Im([rf 1 ,4 1 ])] T 

IV. Cramer-Rao Bound (CRB) for Preamble Design 

The CRB gives the fundamental limit of the variance of any unbiased estimator Q. We focus on the 
CRB for the CFO estimation only at source Si; similar results hold for the CFO estimation at source 

2 We make the dependence of the channel hoi explicit as this leads to the key difference between the two CRBs that we will introduce 
later. 



S 2 . For simplicity, we consider the CRB of <\> 2 \ = 2n(f 2 — fi)L + 9 2 , instead of the CFO (f 2 — fi). The 
CFO is related to <\) 2 \ by a linear transformation, and hence both CRBs are related simply by a linear 
transformation Q. 

A. General Approach 

To obtain the true CRB, so-called to distinguish from the CRBs to be introduced, we have to take into 
account r which is treated as nuisance parameters. A closed-form expression for the true CRB however 
appears to be intractable. Since our aim is to design preambles, it is more useful to have closed-form 
expressions based on lower bounds, or under suitable tight approximations, of the true CRB. To this end, 
we establish two lower bounds of the true CRB, namely, the genie-aided CRB (GCRB) assuming the 
channel h 01 is known in Section IIV-BI and the modified CRB (MCRB) |[T3l assuming that h 01 is not 
known in Section HV-Cl We also obtain the approximate CRB (ACRB) in closed-form which serve as a 
good approximation for the MCRB. 

B. Genie-Aided CRB (with Perfect Knowledge of h iJ 

Theorem \T\ states the GCRB for the estimation of <p assuming knowledge of h i is available. Although 

the only desired parameter of interest is 2 i, the nuisance parameters r are also jointly estimated in the 

derivations for the GCRB. The GCRB (of 2 i) is thus derived for a given parameter set (3 = [0 2 i, r T ] T , 

and thus denoted explicitly as GCRB(0 2 i; r). 

Theorem 1: Assume that Si transmits a BRP with basis block bj and block rotation angle 9i, where 
i — 1,2, and M > 3 basis blocks are usedj. Given the received signal ri in © and the CSI hoi, the 
GCRB of (fi at source Si is 

GCRB(0 21 ;r) = (2rf 1 Gf 1 T$ 1 TG 21 r 21 - 2rf 1 * 1 r 21 )" 1 (10) 

where T = diag(0, 1, 2, . . . M - 1) ® 1 L and 

4 Rr 1 - R^Gai (G^R^Gai)" 1 GgR" 1 , (11) 

^ GgTSxGn (GgSxGu^Gf^TGaL (12) 

3 See Corollary Q] later which states that the GCRB for M = 2 is not well defined. 
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Moreover, $ x satisfies <E»iG 2 i = NLxL . 

Proof: See Appendix lAl ■ 
Remark 2: In the GCRB, the nuisance parameters are treated as deterministic. The true CRB, however, 
treats r as random. A lower bound of the true CRB is given by the extended Miller-and-Chang bound 
(EMCB) Er[GCRB(02i; r)], i.e., the expectation of the GCRB over the nuisance parameters r ifTOl . The 
EMCB is obtained assuming the channel h i is known. This bound is also a lower bound for the true 
CRB assuming hoi is not known, since this knowledge can always be discarded even if available to give 
the same estimator performance. 

From Remark [2l Theorem \T\ provides a lower bound for the true CRB, whether hoi is known or not. 

Next, Theorem |2] states the necessary condition for the true CRB to be bounded. 

Theorem 2: Assume the same conditions as in Theorem \T\ with Si and S<2 transmitting periodic (but 
possibly different) preambles, i.e., 6i = 9 2 = 0. Then the GCRB, and also the true CRB whether the 
channel h i is known or not, are unbounded as (/ 2 — fi) — > for any h 01 . 

Proof: Appendix |B] proves that GCRB(0 21 ;r) — > oo as (f 2 — fi) — > for any r. By Remark [2l the 

true CRB is also unbounded asymptotically, whether the channel h i is known or not, ■ 

Theorem [2] shows that using periodic preambles (even different ones) at both sources leads to an 
unbounded true CRB if the carrier frequencies of these two sources are the same. This result suggests 
that the problem of CFO estimation in two-way relay systems is similar in nature to the problem where 
two carrier frequencies are present and their values have to be estimated; in the latter problem, the CRBs 
of estimating the two carrier frequencies is arbitrarily large as the frequencies approach each other (8]|. 

In practice, the CFO (/ 2 — fi) approaches zero by design but may not be exactly zero. Since the GCRB 
is a continuous function of the CFO, the CRB will still be large if the CFO is small. Thus, reusing 
conventional periodic preambles at both sources can lead to the potentially catastrophic scenario where 
the CFO effectively cannot be estimated, as confirmed also by numerical results in Section I VTll 

Theorem Q] assumes that M > 3 basis blocks are used. Corollary Q] shows that the GCRB is not well 
defined if M = 2. 
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Corollary 1: Assume the same conditions as in Theorem [T] Then the GCRB is undefined if M = 2 
for any h i. 

Proof: See Appendix O ■ 
Corollary \T\ suggests that the minimum number of basis blocks to be used is three. We note that 
Corollary \T\ holds for any BRP, including (conventional) periodic preambles. This is somewhat surprising, 
since for point-to-point and even one-way relay systems, two blocks of periodic preambles are sufficient 
for CFO estimation, see for example IfTTI . Intuitively, this is because the degrees of freedom are insufficient 
when M = 2. Specifically, each of the two carrier frequencies is corrupted by a complex-valued attenuation, 
which result in a total of six (desired or nuisance) real- valued parameters. Consider the extreme case of 
a flat-fading channel and L — 1 symbol is present in each basis block. Then M = 2 basis blocks give 
only two complex-valued received signals, or four real-valued received signals, which are insufficient to 
estimate all the six parameters. On the other hand, M = 3 basis blocks give just sufficient number of 
received signals to estimate all six parameters. 

In view of Corollary \T\ we assume henceforth that M > 3. Since N = ML and L > 1, we have N > 3. 

C. Modified CRB (without Knowledge of h i) 

The GCRB in (flOl ) assumes knowledge of the channel h i, which provides some insights to the true 
CRB. In this section, we assume that h i is not known but random with some known distribution, which 
is a more reasonable assumption in practice. Similar to Section IIV-B[ we assume a given (deterministic) 
parameter set (3 comprising the parameter of interest <p 2 i and the nuisance parameters r. We shall employ 
the MCRB 021 to give a lower bound for the true CRB. 

Before specializing to our system, let us consider the following general real-valued system model: 

r = zip) + He (13) 

where r, z and e are of length n, and H is a square matrix. In (fT3l . r is the received signal, z(/3) = 
[zi((3), ■ ■ ■ , z N ((3)] depends only on the length-p parameter vector (3 — \fii, - ■ ■ ,/3 p ], while H and e are 
random with known distributions. 



12 



Lemma 1: Given r in (fT3~l) , a lower bound for the variance of any unbiased estimator for /3 is where 
i — 1, • • ■ , p, is given by the MCRB 

MCRB(&;{&J^O)= [Fm]« (14) 

(9o(/5h do(0)] 

where F M = E r yy Z' is the modified FIM, g(/3) = log / r | H (r; /3|H) is the conditional log- 

9/3 <9/3 T 

likelihood function, and [X] i? ; denotes the ith diagonal element of the matrix X. If the following assump- 
tions hold: 

Al: H is full rank with probability one; 

A2: the elements of e are i.i.d. (not necessarily Gaussian distributed); 

A3: e and H are independent of each other, and also both are independent of /3, 

then the modified FIM simplifies as 

F M 0)= 1 Z'(P)TZ' T 0) (15) 
where Z' is a p-by-iV matrix with the (i, j)th element as dzj{(3) / d(3i. In (TT5T ), 7 is a scalar that depends 



only on the distribution of e, while Y = Eh 



-1 



depends only on the distribution of H. 



(HH 5 )' 

Proof: Taking H as the nuisance parameter, and applying the results in [13], the MCRB (PT4l) follows 

immediately as a lower bound to the variance of the estimator of The simplification of the modified 

FIM (PT41) to (fT5l) under assumptions is given in Appendix ID] ■ 

We now apply Lemma Q] to obtain a lower bound for the estimation of 2 i given r x in ©. Theorem [3] 

states the MCRB in general which is then expressed in closed-form for a special case. 

Theorem 3: Assume that Si transmits a BRP with basis block bj and block rotation angle 9,i, where 
% — 1, 2, for M > 3. Then a lower bound for the variance of any unbiased estimator for 21 is given by 
the MCRB 

MCRB(0 21 ;r)= [F^ (16) 



where the modified FIM is given by (1151) with the general system model in (1131) specialized to the system 
model in ©. Specifically, we use the substitutions j3 = [0 2 i, r T ] T and z(/3) = Ar as in ([9]), which gives^ 

4 From J8}, R _1 (hoi) is always invertible. Hence, T exists. 
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Fig. 2. Graph of degradation function X(x) for M = 3, • • ■ ,6. 



r = Eh m [R 1 (h i)]. If r = fcl for some constant k > 0, then the MCRB can be expressed in closed-form 

5 



as 



ACRB(0 21 ;r) 



6A; 



where the degradation function X(x) is given by 

[Mcos (Mx/2) sin (x/2) - sin (Mx/2) cos 0/2)] 2 



(17) 



X(x) 



sm 



(Mx/2)] 



(18) 



sin 2 (x/2) [M 2 sin 2 (x/2) 
Clearly, ACRB(0 2 i;r) strictly increases as A increases. 

Proof: See Appendix [0 ■ 
Remark 3: It can be easily verified that the degradation function A(-) is non-negative, symmetric, i.e., 
\(x) = X(—x), and periodic with period 2ir, i.e, X(x) = X(x + 2nk) for any integer k. A plot of A(-) is 
given in Fig. [2] for M > 3. 

Although the general MCRB (fT6l) is an exact lower bound, the expression appears to be intractable. 

In practice, the matrix T is well approximated by a scaled identity matrix. For example, based on the 

simulation conditions in Section IVIIl the magnitudes of some rows of T are plotted in Fig. [3] We see 



This special MCRB shall be used to denote an approximation of the MCRB later, hence we use the acronym ACRB. 
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Fig. 3. A plot of the magnitudes of some rows of T. 

that the diagonal elements of T are almost the same, while the off-diagonal elements are very close 
to zero. Thus, for our problem of interest, the MCRB in (PT71) is in fact a good approximation of the 
exact MCRB given by the non-closed-form (TT6l) . Intuitively, the approximation is accurate because of 
the following observations. If the matrix K in CZl) is a circulant matrix, then from the fact that circulant 
matrices are diagonalizable by the Fourier matrix, it can be shown that T equals a scaled identity matrix 
after performing expectation over h i. Since K can be written as a circulant matrix plus a sparse matrix 
with typically small-magnitude entries, we expect that K is close to being circulant, and both the MCRBs 
are thus approximately the same. Further numerical results from Section IVIII will support the accuracy of 
this approximation. 

Henceforth, we refer to (fT7l) as the approximate CRB (ACRB). As shown in the discussions, the ACRB 
is in closed-form and provides a good approximation of the MCRB if T is well approximated by a scaled 
matrix. To obtain further analytical insights, we focus on the ACRB. 

Our system model covers the case of one-way relaying viz. Remark [T] Theorem |4] states the ACRB for 
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one-way relaying, denoted as ACRB 1-way (</> 2 i; r), which serves as the benchmark for two-way relaying. 
Theorem 4: Suppose the BRP is used by source Si for one-way relaying. Then the ACRB of </> 2 i is 

ACRB 1_way (^ 2 i;r) = m , _ 1)(i , gi , 2i) (19) 

and ACRB 1_way (02i; r) < ACRB(0 21 ; r) with equality if and only if the degradation function A(-) equals 
zero. 

Proof: We omit the proof which is similar to that for Theorem [3] The inequality follows because the 
degradation function is non-negative. ■ 
The inequality in Theorem[4]is intuitively expected since in one-way relaying, there is no self-interference 
to degrade the performance of the estimator. More interestingly, Theorem |4] suggests that the ACRB for 
two-way relaying can achieve the lower bound if the degradation function can be made to be zero. This 
observation partly motivates the BRP design in the next section. 

V. Optimization of BRP Parameters 

In this section, we optimize the parameters of the BRP for both sources, namely, the basis blocks 
bi, b 2 , and the block rotation angles #1, 9 2 , such that the ACRB is minimized. 

We consider the following optimization problem to minimize the ACRB given in (fTTT ): 

min ACRB(0 21 ;r) (20a) 

bi,b2,0i,02 

subject to 6i G (0, 2vr] and bf b; < P for i = 1, 2 (20b) 

where P, represents the power constraint for the BRP sent by source Si. Although we do not explicitly 
consider minimizing the peak-to-average power ratio (PAPR) of the BRP, we shall see in Remark |4] later 
the optimal solution also minimizes the PAPR. 

From the closed-form solution ( fTTT ), the ACRB is proportional to the inverse of (k — A(0 2 i — 0ii))(r^r 21 ) 
where A; is a positive constant such that the first product is positive. Thus, the minimization in (I20al) depends 
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on these joint optimization problems 



max r^r2i 

bl ,132,0! ,02 

min A(A) 

0<A<2tt 



(21a) 
(21b) 



where (12 1 at) is subject to (I20bl) and A = 6 2 — Oi in (I21bl) is the difference of the block rotation angles. 
In <121b|) , it is sufficient to perform the optimization over < A < 2n as A(-) is periodic with period 2ir. 

Due to the presence of the variables 6 l 2 ,6 l i in both optimizations in (I2TT) . to solve (1201) optimally we 
have to consider both optimizations jointly in general. In Section |V-A[ however, we shall see that the 
optimization in (|21al) depends only on the basis block b 2 ; when we optimize the ACRB at source S 2 instead 
of Si here, the optimization then depends only on t>i only. This observation then allows us to decouple 
the two optimization problems in (I2TI) . In Section |V-B[ we shall thus only consider the optimization of 
the angle difference A in (I21bl) . We denote all optimal parameters with the superscript *. 

A. Optimization of Basis Blocks 



Consider the optimization problem (12 1 ab subject to (|20b| ). From the definition r 21>n = h 2 i n ® x 2n in 
Section HH we can write r 2 i = S(b 2 ,6 l 2 )hi 2 where 



3(b 2 ,0 2 ; 



&20 

hi 



b 2 T-ie-^ b 2 r 2 e~^ 



6 2 ie"^ 2 



'20 



>2,L 



_ ie -J92 ... ^-302 



i>2,L-l b 2L _ 2 &2.L-3 

depends only on b 2 , 9 2 . Thus the optimization problem (12 1 ab becomes 



max ||H(b 2 , 6 2 )hi 2 \ 

b 2 ,6> 2 



>20 



(22) 



subject to 9 2 e (0, 2rc] and b^b 2 < P 2 . 



(23a) 
(23b) 



Theorem \5\ later states that the optimal solution is given by modifying the well-known CAZAC sequence 
with some pre-determined phase shifts. A length-L sequence (written as a vector for convenience) b = 
[bo, • • • , &l-i] t is said to be a CAZAC sequence if it satisfies the following properties ffT2l : 
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• constant amplitude: \b,j\ equals a constant for % — 0, ■ • • , L — 1. 

• cyclic-shift orthogonality: the zth cyclic shift of b is orthogonal to the jth cyclic shift of b for i ^ j. 

Given the CAZAC sequence b and an angle parameter 9, we define the generalized CAZAC sequence 

as b' = [&!,■■• , b' L ] T where b' n = b n e jnd ^ L , n — 0, - ■ • ,L — 1. Clearly, if we choose the angle parameter 

9 = 0, the generalized CAZAC sequence specializes to the conventional CAZAC sequence. 

Theorem 5: For the optimization problem (|23l ), the optimal block rotation angle #2 can take any arbitrary 
angle, while the optimal basis block is given by the generalized CAZAC sequence with the angle 
parameter 9 set as the chosen block rotation angle 9\. 

Proof: The objective function in (I23al) can be upper bounded as follows: 



|E(b 2 ,0 2 )hi2| 



L-1 

,rn 



(24) 



m=0 
L-1 

< ||£m^l2,m|| 2 (25) 
m=0 

L-1 L-1 L-1 L-1 

< ^2 \\€m\\ 2 ^2 I^12,n| 2 = ^2 l^ 2m | 2 ^2 l^ 12 '"| 2 ( 26 ^ 
m=0 n=0 m=0 n=0 

L-1 

<Pj2\ k ^\ 2 (27) 

n=0 

where ^ is the zth column of S(b, 9 2 ); the first and second inequalities follow from the triangle inequality 

and the Cauchy-Schwarz inequality, respectively; the last inequality is due to the constraint (I23bl) . Now 

if we use the generalized CAZAC sequence (with power constraint P) as the basis block for the BRP, 

and we set the angle parameter as the block rotation angle 9 2 , then we check that the objective function 

achieves the above bound for any choice of 9 2 . It follows that the stated solutions are optimal. ■ 
Remark 4: Since the generalized CAZAC sequence is obtained from the conventional CAZAC sequence 
with multiplication of a phase term, it retains the desirable property of having constant amplitude. This 
minimizes the PAPR of the transmitted sequence which is a desirable property in preamble designs, see 
e.g. BUl, lfT4l which consider the preamble design for point-to-point channels. 
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B. Optimization of Block-Rotation Angles 

We have shown in Theorem [5] that the optimization problem (12 1 al) does not depend on the the block 
rotation angles. To solve (1201) completely, this section solves the optimization problem (I21bl) . 

In Theorem [2l we observe that the catastrophic case of unbounded CRB occurs if /i = / 2 . Hence, we 

first focus on this case. Theorem [6] states the necessary and sufficient conditions for A* to be optimal 

assuming fi = f 2 . Theorem [7] next considers the case where f 2 approaches f\, but not necessarily fi = f 2 . 

Theorem 6: If fi = f 2 , the necessary and sufficient conditions for A, where < A < 2n, to be an 
optimal solution for the optimization problem (I21bl) are given by 

A y£ (28a) 
Mcos(MA/2)sin(A/2) = sin(MA/2) cos(A/2). (28b) 

Proof: Denote the numerator and denominator of A (A) in (fT8l as N(A) and D(A), respectively, 
where < A < 2ix. It is useful to observe that -D(A) = iff A = for M > 3; this follows from 
Lemma [2] in Appendix E and that sin 2 (A/2) = iff A = 0. Note also that A(A) = JV(A)/D(A) = if 
N(A) = and D(A) ^ 0. 

Since A(-) > 0, to prove Theorem [6l it suffices to show that A(A) = iff ([28]) holds. Note that (|28a| ) 
implies D(A) ^ 0, while (l28bl) implies JV(A) = 0. Thus, (EHl) implies A(A) = 0. 

Next, we show the converse, i.e., A(A) = implies (|28l) . We first show by contradiction that (|28a|) , 
i.e., A 7^ 0, must hold, assuming A(A) = 0. Suppose that A = 0. Then N(A) = 0, and by the well- 
known L'Hospital's rule A(0) = \im^ N(A) / D(A) is strictly positive. This contradicts the assumption 
A(A) = 0. We conclude that A ^ if A(A) = 0. Since A ^ 0, from the earlier observation D(A) ^ 0. 
Thus, A(A) = implies N(A) = or equivalently (I28bl) . Combining the two conditions A/0 and (I28bl) . 

we complete the converse part of the proof. ■ 
Remark 5: If M > 3 is odd, the solution A* = n satisfies (|28T) . By Theorem [6l A* is an optimal solution 
of (I21bl) assuming fi = f 2 . Thus, A* is independent of M, which is desirable if M is not known, e.g., 
due to uncertainty in timing synchronization. Intuitively, this choice of A* creates an artificial frequency 
offset that is maximally possible, since the degradation function is periodic with period 2n. 
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Fig. 4. Graph of degradation function A(A(M)) vs number of basis blocks M for even M > 4, where A(M) = (1 - M/(M 2 - l))ir 
approximates the optimal solution A*(M). 



Remark 6: If M > 4 is even, no closed-form optimal solution is readily available. For small M, we 
solve (T28l) to obtain 

arccos(-2/3) , for M = 4, 

A*(M) = < arccos(-(9 + rf 1 )/(12 + d 1 )), for M = 6, (29) 
arccos(-(4 + d 2 )/(5 + di)) , for M = 8, 

v 

where d x = 2a/2T and d 2 = 2-\/T5cos (arccos (19/ (5y/TE)) /3). For arbitrary even M, a good approx- 
imate solution can be heuristically obtained as A(M) = (1 — M/(M 2 — l))ir. Numerical results based 
on A(M) are given in Fig. HI which shows that the degradation function A(A(M)) is very close to zero 
and decreases quickly with increasing M. 

We next consider the case where f 2 ~ ft but f 2 ^ fx. We expect the CFO to be small because the 
carrier frequencies are typically selected to be close to some designated carrier frequency, or a ranging 
procedure is performed before two-way relaying to calibrate the carrier frequencies. Theorem [7] obtains 
the optimal A* that satisfies (I21bl) such that it also minimizes the (small) perturbation of the degradation 
function around the neighborhood of fi and f 2 . To obtain an explicit closed-form solution, we focus on 
M that is odd. 
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Theorem 7: Let 5 = 2n(f 2 — f\)L and A = 9 2 — Q\. For small 5, the degradation function is given by 

X(A + 5) = X(A)+ Pl 5 + p 2 5 2 + 0(5 3 ), (30) 
M 2 — 1 

where p± = and p 2 = 9 For odd M > 3, the optimal A* that satisfies (I21bl) and minimizes 

4 sin (A/2) 

the second-order perturbation term p 2 is uniquely given by A* = n. 

Proof: After some calculus and algebraic manipulations, the first and second derivatives of A (A) can 
be obtained as A'(A) = pi = and A"(A) = 2p 2 , respectively. The Taylor series of A(A + 5) at A for 
small 5 can then be obtained as given in (1301). We have p 2 > (M 2 - l)/4 with equality iff sin 2 (A/2) = 1, 
i.e., A = 7r. The proof is completed by noting that A = ix also satisfies the optimality in (128T ). ■ 
From Theorem [71 by choosing odd M > 3 and A = n, the degradation function is exactly zero if 
h — fi, an d me first-order perturbation term pi is also zero for small perturbation in the CFO (f 2 — fx). 
Thus, up to a second order perturbation term of the CFO, the angle difference A* = n is optimal. 

VI. CRB-Preserving Implementation 

In this section, we design an estimator that achieves the ACRB that is optimized in Section |V] Although 
the analysis of the CRB for two-way relaying is rather involved, we shall show that CFO estimators that 
achieve the CRB in the point-to-point channel suffices for the two-way relay channel. 

To achieve low complexity in implementation, we introduce a simple preprocessing to reduce the 
received signal vector to a more familiar form. Consider the linear filter Q £ Q^Lxm w ^ orthogonality 
property that 

Q*Gu = <W, (31) 

where the integer m, where 1 < m < (N — l)L, is a design parameter. The matrix Q always exists since 
the nullity of G{\ (of rank L) is NL — L. Applying the linear filter on the received signal vector r x in 
©, we get 

Zl 4 Q% = Q"G 21 r 21 + Q H Ul . (32) 
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We shall use Zj, instead of r^, for CFO estimation. Despite the low complexity, Theorem [8] states that 
there is no loss of optimality in the GCRB if Q a CRB-preserving filter that satisfies these two conditions: 
(i) m — {N- 1)L and (ii) Q is of full rank. 

Theorem 8: Consider the class of linear filter Q with parameter m, where 1 < m < (N — i)L, such 
that (13TI) holds. If a CRB-preserving filter is used, then the GCRB of </> 2 i using the received signal ri, 
given by (flOl ), is the same as the GCRB using instead the filtered signals zi in (T32j 



X — XG21 (G^XG2i) G^X 



TG21T21 



Proof: Denote the function /(X) = ^2r£G£T 
Following the proof in Appendix lAl the GCRB of 2 i using z 1 in (|32l ) can be derived as /($i), where 
*i = Q (Q H RQ) _1 Q H . The GCRB in (DSl) can be alternatively expressed as GCRB(0 2 i;r) = f(Qi), 
where fii = R 1 - R _1 Gn (GgR^GuJ^GgRr 1 . Consider the difference A = Oi - ¥1 = 
R _1 -Xi (Xf RXi) -1 Xf , where Xi = [R -1 Gn, Q] e C NLx ^ L+m \ Since R 1 is of rank NL, whereas 
Xi (X^RXi) 1 Xf is of rank L + m, the sufficient and necessary condition for A to be zero is to make 
Xi full rank, or equivalently, m — (N — 1)L, and Q is of full rank. When these two conditions hold, Xi 
is invertible and A = Onlxnl- Thus $ x = Q 1 , implying the two GCRBs are the same. ■ 
Theorem [8] suggests that the filtered signal z\ is as good as the original received signal ri for CFO 
estimation, assuming knowledge of the channel h i. This is supported by numerical results in Section IVTI1 
even if h 01 is not known. 

Next, we give a specific realization of the CRB-preserving filter. Any Q that spans the nullspace of 
Gfy must satisfy (T3T1) . Thus there are infinitely many possible matrices Q that result in no loss in CRB. 
A specific choice of filter for source Si that satisfies the conditions in Theorem [8] is 



H 



Pll l L -I L L ■■■ L 
Ol PiJ-l -II 

L ■■■ L p u l L -l L . 



(33) 



We can interpret the filter Q as a blockwise low pass filter with coefficients {pn, —1} that operate on 
the received signal in two blocks of L samples. The advantage of using this filter is that it leads to low 
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implementation complexity, comparable to point-to-point communication systems. Observe that the filtered 
vector zi can be interpreted to have an equivalent channel Q H G 2 i = (1 — P2i)G 2 i with an equivalent 
Gaussian noise Qfui. The signal model is the same as a point-to-point communication system where a 
periodic preamble is sent, experiences a CFO of 9 2 , and is received with Gaussian noise with correlation 
matrix Qf Qi. Hence, we can use any CRB-achieving estimator for CFO estimation in point-to-point 
communication, and yet achieves the CRB for two-way relaying. 

VII. Simulation Results And Discussions 

Without loss of generality, we consider the estimation at source Si. In this section, we shall see 
that the proposed BRP design can achieve a mean-squared-error (MSE) performance that is close to the 
EMCB, which is the fundamental lower bound according to Remark |2j Specifically, the EMCB is obtained 
numerically by averaging the GCRB assuming knowledge of h i is available. To obtain the numerical 
results for the MSE, we consider two specific estimators, namely the correlator, see e.g. Ifl31 . and the 
genie-aided maximum likelihood estimator (GA-MLE). They represent schemes with very low and very 
high complexity, respectively, and both are commonly used. Both estimators work on the output zi of the 
blockwise linear filter in (|33l ), obtained based on (|32l) , to take advantage of the fact that the signal is free 
of self-interference and the link becomes a point-to-point channel. 

The correlator estimates 02i as </>cor = ^ J2m=o z im z hm+u where zi m is a column vector that collects 
elements of Zi in (|32l ) with indices {mL, . . . , (m + 1)L — 1}. The GA-MLE, on the other hand, is given 
the channel h i and hence knows the noise covariance matrix given by ©. Thus the noise ui is known to 
be Gaussian distributed, and this estimator then performs conventional ML estimation to obtain the CFO. 
In practice, h 01 is not known and hence the estimator provides an optimistic MSE performance that may 
not be realized in practice. 

We assume the following simulation setup. We use M = 5 basis blocks and L = 16 samples in each 
basis block. All channel taps experience independent Rayleigh fading of magnitude regulated by a 8-tap 
exponential power delay profile fee - ™/ 7 ™ 8 , where n is the tap index and A; is a normalizing constant. We 
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choose r rms = 1. For simplicity, it is assumed that the two sources Si and S 2 communicate with equal 
power P, and the variance of the AWGN a\ at all receivers are identical. The signal-to-noise ratio is 
defined as Pjo\. The scaling factor at the relay is set such that the total transmission power is 2P. The 
carrier frequencies of the two sources relative to the carrier frequency of the relay are (arbitrarily) set to 
fi = 0.001 and f 2 = -0.002. 

In Figure |5l we plot the EMCB given by averaging the GCRB for the following three cases: (i) two-way 
relaying with periodic preambles (with no marker); (ii) two-way relaying with BRP optimized according 
to Section [V] (with marker " + "); (iii) one-way relaying with periodic preambles (with marker " x "). We 
observe that the first case has fairly large averaged GCRB, which is expected according to Theorem [21 
With the optimized BRP, however, the averaged GCRB is reduced substantially by about twenty times. 
Moreover, the averaged GCRB for both two-way relaying and one-way relaying are almost the same when 
the optimized BRP is used, as suggested by Theorem @] based on the ACRB. 

Theorem [8] states that a CRB-preserving filter with the optimized BRP allow the GCRB to be approached. 
To check this, in Figure \5\ we also plot the MSE for two-way relaying where we use the optimized BRP 
and the following estimators: (i) correlator (with marker "□"); (ii) GA-MLE (with marker " o "). We 
observe that the GA-MLE performs close to the averaged GCRB for two-way relaying with optimized 
BRP. This is expected when knowledge of h i is available, according to Theorem [8] At high SNR, the 
correlator performs close to the GA-MLE. This suggests that the additional knowledge of the channel h 01 
does not improve the MSE performance, and so a low complexity estimator suffices. We note that the 
correlator outperforms the averaged GCRB at SNRs lower than about 7 dB. This is because the correlator 
is not an unbiased estimator as we have numerically confirmed. Nevertheless, in the high-SNR regime 
the correlator becomes asymptotically unbiased, and so the MSE still becomes lower bounded by the 
averaged GCRB. 
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SNR (dB) 

Fig. 5. Mean-square-error (MSE) and the averaged GCRB for CFO estimation. 

VIII. Conclusions 

A novel block-rotation-based preamble (BRP) design for CFO estimation in amplify-and-forward two- 
way relaying systems has been proposed. The BRP can be viewed as a generalization of the conventional 
periodic preamble widely used in practice. Intuitively, the BRP creates an artificial block-level frequency 
offset so as to distinguish the carrier frequencies of the two sources. Our analysis on the fundamental lower 
bound of the MSE performance allows us to identify the catastrophic case when the CRB is unbounded 
or fails to exist, which has not been identified in the literature so far. Also, our analysis provides practical 
guidelines to design BRPs that perform close to the fundamental lower bound. Finally, since the carrier 
frequency of the relay does not affect the analysis, the proposed BRP design and estimation schemes appear 
to be readily applicable to communication systems with more than one relay, and also to multiple- antenna 
communication systems. 
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For convenience, let W 



Appendix A 
Proof of Theorem Q] 
Re(R~ 1 ) -In^Rr 1 

In^R" 1 ) Re(R- 1 



A 9A ~ 



Re(jTG 21 r 21 ) 
Im(jTG 21 r 21 ; 



and T 



diag(0, 1, ■ • • , M - 1) ® I L . 

The parameters to be estimated are (3 = [02i> r n> r li] T - Given (3, and since h i is given, the received 
signal ri in ® has the Gaussian distribution 



f ri (r i; (3) = ir~ N det(W) exp [- (r - Arf W (r - A?) 



(34) 



The Fisher Information Matrix (FIM) is thus 



F(hoi) = E ri 



/ 



V 



aiog[/ ri (r i; /3)] 



'21 



dlog [frAn; {3)] 
dr 



01og[/ ri (r i; /3)] 

5021 

<91og[/ ri (r i; /3)] 
dr 







q ff Wq 


q H WA 










= 2 






• (35) 


. ) 




A^Wq 


A ff WA 





The CRB matrix, given by the inverse of the FIM, can then be obtained. The (1, l)th element of the CRB 
matrix can be isolated, by the use of the block matrix inversion lemma, to give 

1 



GCRB(0 21 ;r) 



2 q H [W - WA (A^WA) -1 A^W] q' 
After some algebraic manipulations, we have 

Re(M) -Im(M) 
Im(M) Re(M) 



(36) 



WA (A fl WA) X A^W 



(37) 



where 



M = R 1 



Gu G 



21 



^21 



R 1 



Gu G 



21 



\ 



/ 



-1 



R G 2i (G 2 ^R~ 1 G 2 i) 1 G^R" 1 + ^xGn (Gf^G^) 1 Gf^ 



^21 
1 riH. 



R 1 



(38) 



and $i is given in ([Tfl> . From CCD and (I36l)-(l38l). we obtain CEIl and (fT2|). From (fT2T> . clearly $i satisfies 
$iG 2 i = OAr ixi . 
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Appendix B 
Proof of Theorem [2] 

From the Taylor's series e^ e = 1 + jO + 0(9 2 ) for small 9, we can express 



Gu = (I-j( 



IT) G 2 i + O ((02i - 0ii) 2 Wl) 



(39) 



From Theorem \T\ we have $iG 2 i = Onlxl- Thus from (1391) we get 



'21 



i 11 ) 2 Gf 1 T# 1 TG 21 + O ((0 21 - u ) 3 Il) 



Gf^iTG 



21 = J 1021 



'11 



lGf 1 T* 1 TG 2 i + O ((02i - 0ii) 2 Il) 



(40) 
(41) 



which imply *i = Gf 1 T*iTG 2 i+0 (|0 2 i - 0h|Il). Since 9 V = 9 2 = 0, the denominator of GCRB(0 21 ; r) 
in (flOl) . i.e., 2r^ (G^T$iTG 2 i — \I>i) r 2 i, approaches zero as f 2 — fi — (0 2 i — 0i 2 )/(27rL) approaches 
zero. Thus, we have proved GCRB(0 2 i; r) — > oo as (/ 2 — /i) -4-0, which holds for any r. 

Appendix C 
Proof of Corollary Q] 



Suppose M = 2, thus G 2 i = [P21U, pliUf- Let P = \ 



G21G21 



, where G 2 i = [l L , -p2iU] T - Clearly 



PP H = I 2 l- Also, let $1 = QDQ W be the SVD of $1. We first show that we can express 

$1 = G 2 iB 2 iG^ 



(42) 



where B 21 4 i (W 21 D 1 Wf 1 + W 22 D 2 Wg). 

Since both P, Q are unitary matrices, there always exists a unitary matrix W such that Q = PW. 
It can then be shown that Q i/ $ 1 G 2 i = DQ^G^ = DW H P^G 2 i. From Theorem HI we know that 
$iG 2 i = 2 lxl, i-e., DW H P H G 21 = 2LxL . It can be easily verified that P^G 2 i = L f, thus we 



get 



where D 



Di L 



l D 2 

(1431) into $1 = QDQ H then leads to ((421). 



G C 2Lx2L and W 



D l W^ = L ,^ = l,2, 
Wn W12 
W 2 i w 22 



(43) 



e C 2Lx2L . Substituting Q = PW and 
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Using (I42T). it can then be verified that G^*iGn = 1 1 - e j ^ 21 - <t>ll) | 2 B 2 i. It follows that *i = 
G^T$iTG 2 i, i.e., the denominator of GCRB(</> 21 ; r) in (fTOt equals zero, independent of h i. Thus the 
CRB is undefined for any channel h i. 

Appendix D 
Proof of (IT3T ) in Lemma [H 

Let e = He. We write g(0) = log/ r | H (r; /3|H) as 

0(/3) = log/s(r-z(/3)|H) (44) 
= log|J(H)| x /.(HT^r-zOg))- 1 ) (45) 

iV 

= log|J(H)| + ^log/ £n (e n ) (46) 

n=l 

where (1441) follows from the independence of the random variables, (1431 ) follows from the transformation 
of e to e via the full-rank H and J(H) is the corresponding Jacobian, and (1461) follows from the i.i.d. 
assumption of e = r — z(/3) -1 . After some algebraic manipulations, we get 

dg0) 



(47) 



8(3 tifeniQ 

where we denote / e ' n (e n ) = 9f 6n (e n )/de n and is the nth row of H" 1 , while the definition of Z'((3) 
appears after (IT31) . 



We now obtain the modified FIM F 



M 



/e«( e n) 



E 



H 



Oat x1 due to E e 



= E r 

fe n ( e n) 



W) W) Letp 

9/3 <9/3 T J 

= Co ^' ?l ( e «) de ™ = 5 (.Co /^( e «) de n)/<9e n = 0. Using 



. Let p n £ £gyh n . Note that E H , e [p, 



this observation and (147I) . the modified FIM simplifies as 



F M (/3) = Z'(/3)-E: 



H,e 



iV 



n=l 



Z' T (/3). 



(48) 



In ( |48l ), we use the fact that Z'(/3) is independent of H and e. The middle matrix can be written as 



E 



H,e 



N 



n=l 



N 



n=l 



7 r 



where 7 ^E £ ( ^Pi 
Hence, we obtain (PT51) . 



andr^En=iE H |hX 



, and it can be readily checked that T = E H 



(49) 



(HH H \ 
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Appendix E 
Proof of Theorem [3] 

We use the substitution (3 = [0 2 i, r T ] T and z(/3) = Ar as in ®. Next, we express the noise vector Ux 
in © as He, such that assumptions hold. Given h i, the noise vector in ([7]) is Gaussian distributed 

with zero mean and co variance matrix R(h i) = K(hoi)K H (h i) + 1 which is full rank with probability 
one. Let the eigendecomposition of the covariance matrix be R(h i) = UAU^; also let A 1 / 2 be a 
diagonal matrix with diagonal elements given by the square root of the corresponding diagonal elements 
in A. Without loss of generality, we can express ui in © instead as 

Ul = H(h i)e, (50) 

where H(h i) = UA 1 / 2 and e is an i.i.d. zero-mean unit- variance complex- valued Gaussian vector that 
is independent of hoi. This is because both representations of Ui are statistically equivalent. Taking h i 
to be random in general, we see that assumptions Al to AA always hold. Applying the system model in 
(fT3l) . the MFIM is given by ([TBI where after some tedious but straightforward algebraic manipulations, 
we obtain V = E hoi [R _1 (h 01 )]. This proves the first part of Theorem |3j 

For the second part of the proof, suppose that V = kl for some constant k > 0. Then the MFIM and the 
MCRB can be obtained similarly as given by the FIM (|35l ) and GCRB (|36l ) in Appendix [A] respectively, 
with the covariance matrix R replaced by (l/k)I. With this substitution R = (l/fc)I, after some tedious 
but straightforward algebraic manipulations, we obtain the closed-form expression (fTTT) . 

Appendix F 
An Auxillary Lemma to Prove Theorem [6] 

Lemma 2: For integer M > 3, M 2 sin 2 (x/2) = sin 2 (Mx/2) holds iff x G S = {x = 2kn, k e Z}. 

Proof: We consider the case of odd M and even M separately. 

Assume that integer M > 3 is odd. We use the well-known identity of the Dirichlet kernel sm ^ n + l j£> x ) = 
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l + 2£ 



n 



cos(kx). By substituting n 



(M — l)/2 and squaring, we get 



k=l 




(51) 



with equality iff cos(kx) = 1 for k = 1, 2, • • • , (M — l)/2, i.e., x G 5. 

Assume that integer M > 4 is even. By the double angle formula sin(2x) = 2sin(x) cos(x), we get 

M/2 

sin(Ma;/2) = Msin(x ) Y[ cos{kx/2). 

k=l 

Thus, sin 2 (Mx/2) < M 2 sin 2 (x) with equality iff cos 2 (A;x/2) = 1 for k = 1, • • • , M/2, i.e., a:e5. ■ 
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